Learning Bayesian Network Classifiers for Credit Scoring Using Markov Chain Monte Carlo Search
نویسندگان
چکیده
In this paper, we will evaluate the power and usefulness of Bayesian network classifiers for credit scoring. Various types of Bayesian network classifiers will be evaluated and contrasted including unrestricted Bayesian network classifiers learnt using Markov Chain Monte Carlo (MCMC) search. The experiments will be carried out on three real life credit scoring data sets. It will be shown that MCMC Bayesian network classifiers have a very good performance and by using the Markov Blanket concept, a natural form of input selection is obtained, which results in parsimonious and powerful models for financial credit scoring.
منابع مشابه
A Novel Learning Algorithm for Bayesian Network and Its Efficient Implementation on GPU
Computational inference of causal relationships underlying complex networks, such as gene-regulatory pathways, is NP-complete due to its combinatorial nature when permuting all possible interactions. Markov chain Monte Carlo (MCMC) has been introduced to sample only part of the combinations while still guaranteeing convergence and traversability, which therefore becomes widely used. However, MC...
متن کاملMachine Learning, Information Retrieval, and Record Linkage
Classification into groups using terms available in the data underlies machine learning, information retrieval, and record linkage. Classifiers such as Bayesian networks in machine learning and term weighting in information retrieval depend primarily on training data sets for which truth is known. These classifiers may be relatively slow to adapt to new situations in which new data have charact...
متن کاملBayesian Analysis of Finite Poisson Mixtures
Finite Poisson mixtures are widely used to model overdispersed data sets for which the simple Poisson distribution is inadequate. Such data sets are very common in real applications. In this paper we investigate Bayesian estimation via MCMC for finite Poisson mixtures and we discuss some computational issues. The related problem of determining the number of components in a mixture is also treat...
متن کاملNew Approaches to Importance Sampling for Portfolio Credit Risk Valuation by Zhe Wang
New Approaches to Importance Sampling for Portfolio Credit Risk Valuation Zhe Wang Master of Science Graduate Department of Computer Science University of Toronto 2015 Portfolio credit risk based on the Gaussian Copula model has been widely studied and generally evaluated through Monte Carlo simulations. The two-level structure, namely systematic factors and individual factors, complicates the ...
متن کاملOn Inclusion-Driven Learning of Bayesian Networks
Two or more Bayesian network structures are Markov equivalent when the corresponding acyclic digraphs encode the same set of conditional independencies. Therefore, the search space of Bayesian network structures may be organized in equivalence classes, where each of them represents a different set of conditional independencies. The collection of sets of conditional independencies obeys a partia...
متن کامل